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Abstract. In mean-payoff games, the objective of the protagonist is to ensure that the limit average 
of an infinite sequence of numeric weights is nonnegative. In energy games, the objective is to ensure 
J.—S , that the running sum of weights is always nonnegative. Generalized mean-payoff and energy games 

fvj ' replace individual weights by tuples, and the limit average (resp. running sum) of each coordinate must 

be (resp. remain) nonnegative. These games have applications in the synthesis of resource- bounded 
^ . processes with multiple resources. 

^^ ' We prove the finite-memory determinacy of generalized energy games and show the inter-reducibility of 

generalized mean-payoff and energy games for finite-memory strategies. We also improve the computa- 
tional complexity for solving both classes of games with finite-memory strategies: while the previously 
best known upper bound was EXPSPACE, and no lower bound was known, we give an optimal coNP- 
complete bound. For memory less strategies, we show that the problem of deciding the existence of a 
winning strategy for the protagonist is NP-complete. 
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Graph games and multi-objectives. Two-player games on graphs are central in many applications 



Q^ I of computer science. For example, in the synthesis problem, implementations are obtained from 

^ ■ winning strategies in games with a qualitative objective such as w-regular specifications [17, 16, 1]. 

In these applications, the games have a qualitative (boolean) objective that determines which player 



wins. On the other hand, games with quantitative objective which are natural models in economics 
f^ ' (where players have to optimize a real-valued payoff) have also been studied in the context of 

^^ . automated design [18,9, 19]. In the recent past, there has been considerable interest in the design 

of reactive systems that work in resource-constrained environments (such as embedded systems). 
The specifications for such reactive systems are quantitative, and these give rise to quantitative 
^ ■ games. In most system design problems, there is no unique objective to be optimized, but multiple, 

potentially conflicting objectives. For example, in designing a computer system, one is interested 
not only in minimizing the average response time but also the average power consumption. In 
this work we study such multi-objective generalizations of the two most widely used quantitative 
objectives in games, namely, mean-payoff and energy objectives [10,19,6,3]. 

Generalized mean-payoff games. A generalized mean-payoff game is played on a finite weighted 
game graph by two players. The vertices of the game graph are partitioned into positions that 
belong to Player 1 and positions that belong to Player 2. Edges of the graphs are labeled with 
/c-dimensional vectors w of integer values, i.e., w ^ 'L . The game is played as follows. A pebble 
is placed on a designated initial vertex of the game graph. The game is played in rounds in which 
the player owning the position where the pebble lies moves the pebble to an adjacent position of 
the graph using an outgoing edge. The game is played for an infinite number of rounds, resulting 
in an infinite path through the graph, called a play. The value associated to a play is the mean 
value in each dimension of the vectors of weights labeling the edges of the play. Accordingly, the 



winning condition for Player 1 is defined by a vector of integer values v (z TJ' that specifies a 
threshold for each dimension. A play is winning for Player 1 if its vector of mean values is at 
least V. All other plays are winning for Player 2, thus the game is zero-sum. We are interested in 
the problem of deciding the existence of a finite-memory winning strategy for Player 1 in generalized 
mean-payoff games. Note that in general infinite memory may be required to win generalized mean- 
payoff games, but for practical applications such as the synthesis of reactive systems with multiple 
resource constraints, the generalized mean-payoff games with finite memory is the relevant model. 
Moreover, they provide the framework for the synthesis of specifications defined by [2,8], and the 
synthesis question for such specifications under regular (ultimately periodic) words correspond to 
generalized mean-payoff games with finite-memory strategies. 

Generalized energy games. In generalized energy games, the winning condition for Player 1 requires 
that, given an initial credit vq G N^, the sum of vq and all the vectors labeling edges up to position 
i in the play is nonnegative, for all i € N. The decision problem for generalized energy games 
asks whether there exists an initial credit vq and a strategy for Player 1 to maintain the energy 
nonnegative in all dimensions against all strategies of Player 2. 

Contributions. In this paper, we study the strategy complexity and computational complexity of 
solving generalized mean-payoff and energy games. Our contributions are as follows. 
First, we show that generalized energy and mean-payoff games are determined when played with 
finite-memory strategies, however, they are not determined for memoryless strategies. For gener- 
alized energy games determinacy under finite-memory coincides with determinacy under arbitrary 
strategies (each player has a winning strategy iff he has a finite- memory winning strategy). In 
contrast, we show for generalized mean-payoff games that determinacy under finite-memory and 
determinacy under arbitrary strategies do not coincide. Thus with finite-memory strategies these 
games are determined, they correspond to the synthesis question with ultimately periodic words, 
and enjoy pleasant mathematical properties like existence of the limit of the mean value of the 
weights, and hence we focus on the study of generalized mean-payoff and energy games with finite- 
memory strategies. 

Second, we show that under the hypothesis that both players play either finite-memory or memo- 
ryless strategies, the generalized mean-payoff game and the generalized energy game problems are 
equivalent. 

Third, our main contribution is the study of the computational complexity of the decision problems 
for generalized mean-payoff games and generalized energy games, both for finite-memory strategies 
and the special case of memoryless strategies. Our complexity results can be summarized as follows: 
(A) For finite-memory strategies, we provide a nondeterministic polynomial time algorithm for 
deciding negative instances of the problems^. Thus we show that the decision problems are in coNP. 
This significantly improves the complexity as compared to the EXPSPACE algorithm that can be 
obtained by reduction to Vass (vector addition systems with states) [4]. Furthermore, we establish 
a coNP lower bound for these problems by reduction from the complement of the 3SAT problem, 
hence showing that the problem is coNP-complete. (B) For the case of memoryless strategies, as the 
games are not determined, we consider the problem of determining if Player 1 has a memoryless 
winning strategy. First, we show that the problem of determining if Player 1 has a memoryless 
winning strategy is in NP, and then show that the problem is NP-hard (i) even when the weights 
are restricted to { — 1, 0, 1}; or (ii) when the weights are arbitrary and the dimension is 2. 



■* Negative instances are those where Player 1 is losing, and by determinacy under finite-memory where Player 2 is 
winning. 



Related works. Mean-payoff games, which are the one-dimension version of our generahzed mean- 
payoff games, have been extensively studied starting with the works of Ehrenfeucht and Mycielski 
in [10] where they prove memory less determinacy for these games. Because of memory less determi- 
nacy, it is easy to show that the decision problem for mean-payoff games lies in NP n coNP, but 
despite large research efforts, no polynomial time algorithm is known for that problem. A pseudo- 
polynomial time algorithm has been proposed by Zwick and Paterson in [19], and improved in [5]. 
The one-dimension special case of generalized energy games have been introduced in [6] and further 
studied in [3] where log-space equivalence with classical mean-payoff games is established. 

Generalized energy games can be viewed as games played on Vass (vector addition systems with 
states) where the objective is to avoid unbounded decreasing of the counters. A solution to such 
games on Vass is provided in [4] (see in particular Lemma 3.4 in [4]) with a PSPACE algorithm 
when the weights are {—1, 0, 1}, leading to an EXPSPACE algorithm when the weights are arbitrary 
integers. We drastically improve the EXPSPACE upper-bound by providing a coNP algorithm for 
the problem, and we also provide a coNP lower bound even when the weights are restricted to 
{-1,0,1}. 

2 Generalized Mean-payoff and Energy Games 

Well quasi-orders. Let D be a set. A relation ^ over D is a well quasi-order, wqo for short, if the 
following holds: (a) :< is transitive and reflexive; and (b) for all / : N — )■ D, there exists ii,i2 G N 
such that ii < ^2 and f{ii) ^ /(^2)- 

Lemma 1. (N'^, <) is well quasi- ordered. 

Multi-weigthed t'wo-player game structures. A multi-weigthed two-player game structure is 
a tuple G = {Si, S2, Sinit, E, k, w) where Si (1 S2 = 0, and Si {i = 1,2) is the finite set of Player i 
positions, Sjnit G Si is the initial position, E C (^i U 52) x {Si U S2) is the set of edges such that 
for all s € S*! U S2, there exists s' G 5*1 U 5*2 such that {s, .s') £ E, k G N is the dimension of the 
multi-weights, w : E ^- Z'' is the multi-weight labeling function. G is a multi-weighted one-player 
game structure if 5*2 = 0. 

A play in G is an infinite sequence of vr = sqSi . . . s„ . . . such that (i) sq = Sjnit, {H) for alH > 
we have {si, Sj+i) £ E. K. play vr = sqSi . . . Sn ■ ■ ■ is ultimately periodic if it can be decomposed as 
TT = pi ■ P2 where pi and p2 are two finite sequences of positions. The prefix up to position n of a 
play vr = sqSi . . . s„ . . . is the finite sequence 7r(n) = sqSi . . . s„, its last element s„ is denoted by 
Last(7r(n)). A prefix 7r(n) belongs to Player i (i € {1,2}) if Last(7r(n)) € Si. The set of plays in G 
is denoted by Plays(G), the corresponding set of prefixes is denoted by Prefs(G), the set of prefixes 
that belongs to Player i (i € {1,2}) is denoted by Prefsj(G), and the set of ultimately periodic 
plays in G is denoted by Plays"*'(G). 

The energy level vector of a prefix of play p = sqSi . . . s„ is EL(/9) = X^^Iq" w{si, Sj+i), and the 
mean-payoff vector oi an ultimately periodic play tt = sqSi . . . s„ . . . is MP(7r) = lim„_!.oo ~EL(7r(n)). 

Strategies. A strategy for Player i {i £ {1,2}) in G is a function Aj : Prefsj(G) — > 6*1 U 6*2 such 
that for all p € Prefsi(G) we have (Last(p), Aj(p)) £ E. K play vr = sqSi • • • € Plays(G) is consistent 
with a strategy Aj of Player i if Sj+i = Aj(soSi . . . Sj) for all j > such that Sj € Si. The outcome 
of a pair of strategies, Ai for Player 1 and A2 for Player 2, is the (unique) play which is consistent 
with both Ai and A2. We denote outcomeG(Ai, A2) this outcome. 



A strategy Ai for Player 1 has finite-memory if it can be encoded by a deterministic Moore 
machine (M, mO) OujOn) where M is a finite set of states (the memory of the strategy), niQ G M 
is the initial memory state, a^j : M x {Si U 52) — )■ M is an update function, and Un '■ M x Si ^ 
Si U 5*2 is the next-action function. If the game is in a Player-1 position s ^ Si and m a M 
is the current memory value, then the strategy chooses s' = an{m,s) as the next position and 
the memory is updated to auijn^s). Formally, {M,mo,au,an) defines the strategy A such that 
X{p ■ s) = aniau{mo, p),s) for all p G {Si U ^2)* and s G Si, where a„ extends «„ to sequences 
of positions as expected. A strategy is m,em,oryless if \M\ = 1. For a finite- memory strategy Ai of 
Player 1, let Gx-^ be the graph obtained as the product of G with the Moore machine defining Ai, 
with initial vertex (mg, Sinit) and where {{m, s), {m' , s')) is a transition in Gx-^ if m' = au{m, s), and 
either s G Si and s' = anim, s), 01 s £ S2 and (s, s') £ E. The set of inifinite paths in Gxi and the 
set of plays consistent with Ai coincide. A similar definition can be given for the case of Player 2. 

Objectives. An objective for Player 1 in G is a set of plays W C Plays(G). A strategy Ai for 
Player 1 is winning for VF in G if for all plays in vr € Plays(G) that are consistent with Ai, we have 
that vr G W. A strategy A2 for Player 2 is spoiling for VF in G if for all plays in vr € Plays(G) that 
are consistent with A2, we have that vr W. We consider the following objectives: 

^ Multi Energy objectives. Given an initial energy vector vq G N^, the multi energy objective 
PosEnergy(^(fo) = {vr G Plays(G) | Vn > : vq + EL(7r(n)) G N'^} requires that the energy level 
in all dimensions remains always nonnegative. 

— Multi Mean-payoff objectives. Given a threshold vector w G Z*^, the multi mean-payoff objective 
MeanPayoff(j(f) = {vr G Plays"^(G) | MP(7r) > v} requires for all dimensions j the mean-payoff 
for dimension j is at least v{j). 

Decision problems. We consider the following decision problems: 

— The unknown initial credit problem asks, given an multi-weighted two-player game structure G, 
to decide whether there exists an initial credit vector vq G N^ and a winning strategy Ai for 
Player 1 for the objective PosEnergyQ(t;o). 

— The mean-payoff threshold problem (for finite memory) asks, given an multi-weighted two-player 
game structure G and a threshold vector v G Z^, to decide whether there exists a finite- 
memory strategy Ai for Player 1 such that for all finite-memory strategies A2 of Player 2, 
outcomeG'(Ai, A2) G MeanPayoff (;.(«). 

Note that in the unknown initial credit problem, we allow arbitrary strategies (and we show in 
Theorem 2 that actually finite- memory strategies are sufficient), while in the mean-payoff threshold 
problem, we require finite-memory strategy which is restriction (according to Theorem 4) of a more 
general problem of deciding the existence of arbitrary winning strategies. 

Determinacy and determinacy under finite-memory. A game G with an objective W is 
determined if either Player 1 has a winning strategy, or Player 2 has a spoiling strategy. A game G 
with an objective W is determined under finite-memory if either (a) Player 1 has a finite-memory 
strategy Ai such that for all finite-memory strategies A2 of Player 2, we have outcomeG(Ai, A2) G W; 
or (b) Player 2 has a finite-memory strategy A2 such that for all finite-memory strategies Ai 
of Player 1, we have outcomeG(Ai, A2) ^ W. Games with objectives W are determined (resp. 
determined under finite-memory) if all game structures with objectives W are determined (resp. 
determined under finite- memory) . We say that determinacy and determined under finite- memory 
coincide for a class of objectives, if for all objectives in the class and all game structures, the answer 



of the determinacy and determined under finite-memory coincide (i.e., Player 1 has a winning 
strategy iff^ there is a finite-memory winning strategy, and similarly for Player 2). Generalized 
mean-payoff and energy objectives are measurable: (a) generalized mean-payoff objectives can be 
expressed as finite intersection of mean-payoff objectives and mean-payoff objectives are complete 
for the third level of Borel hierarchy [7]; and (b) generalized energy objectives can be expressed as 
finite intersection of energy objectives, and enery objectives are closed sets. Hence determinacy of 
generalized mean-payoff and energy games follows from the result of [14]. 

Theorem 1 (Determinacy [14]). Generalized mean-payoff and energy games are determined. 

3 Determinacy under Finite-memory and Inter-reducibility 

In this section, we establish four results. First, we show that to win generalized energy games, it 
is sufficient for Player 1 to play finite-memory strategies. Second, we show that to spoil general- 
ized energy games, it is sufficient for Player 2 to play memoryless strategies. As a consequence, 
generalized energy games are determined under finite-memory. Third, using this finite-memory de- 
terminacy result, we show that the decision problems for generalized energy and mean-payoff games 
(see Section 2) are log-space inter-reducible. Finally, we show that infinite-memory strategies are 
more powerful than finite- memory strategies in generalized mean-payoff games. 

For generalized energy games, we first show that finite-memory strategies are sufficient for 
Player 1, and then that memoryless strategies are sufficient for Player 2. 

Lemma 2. For all multi-weighted two-player game structures G, the answer to the unknown initial 
credit problem is Yes iff there exists a initial credit vq € N'^ and a finite-memory strategy A™ for 
Player 1 such that for all strategies A2 of Player 2, outcomeG(A™, A2) € PosEnergyQ(wo). 

Proof. One direction is trivial. For the other direction, assume that Ai is a (not necessary finite- 
memory) winning strategy for Player 1 in G with initial credit f G N . We show how to construct 
from Ai a finite-memory strategy A™ which is winning against all strategies of Player 2 for initial 
credit vq. For that we consider the unfolding of the game graph G in which Player 1 plays according 
to Ai. This infinite tree, noted 7b(Ai), has as set of nodes all the prefixes of plays in G when 
Player 1 plays according to Ai. We associate to each node p = sqSi . . . Sn in this tree the energy 
vector Vq + EL(p). As Ai is winning, we have that vq + EL(p) G N for all p. Now, consider the 
set {Si U S2) X N'^, and the relation C on this set defined as follows: {si,vi) C (^2,^2) iff si = S2 
and vi < V2 i.e., for all i, 1 < i < k, vi{i) < V2{i). The relation C is a wqo (easy consequence 
of Lemma 1). As a consequence, on every infinite branch tt = sqSi . . . s„ . . . of T^m^), there exists 
two positions i < j such that Last(7r(i)) = Last(7r(j)) and EL(7r(i)) < EL(7r(j)). We say that node 
j subsumes node i. Now, let T^n, ) be the tree T'g(Ai) where we stop each branch when we reach 
a node n2 which subsumes one of its ancestor node ui. Clearly, Tfjfy^ -, is finite. Also, it is easy to 
see that Player 1 can play in the subtree rooted at n2 as she plays in the subtree rooted in ni 
because its energy level in 722 is greater than in ui. From T^/^ •>, we can construct a Moore machine 

which encode a finite-memory strategy A™ which is winning the generalized energy game G as it 
is winning for initial energy level vq. 

Lemma 3. [4-] For all multi-weigthed two-player game structures G, the answer to the unknown 
initial credit problem is No if and only if there exists a memoryless strategy A2 for Player 2, 
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Fig. 1. Player 1 (round states) wins with initial credit (2,0) when Player 2 (square states) can use memoryless 
strategies, but not when Player 2 can use arbitrary strategies. 



such that for all initial credit vectors vq € N and all strategies Ai for Player 1 we have 
outcomeG(Ai,A2) PosEnergyG'(7;o). 

Proof. The proof was given in [4] [Lemma 19]. Intuitively, consider a Player-2 state s G 5*2 with 
two sucessors s', s". If an initial credit vector v'q is sufficient for Player 1 to win against Player-2 
always choosing s', and Vq is sufficient against Player-2 always choosing s", then v'q + Vq is sufficient 
against Player-2 arbitrarily alternating between s' and s". This is because of the fact that if Player 1 
maintains all energies nonnegative when initial credit is vq, then he can maintain all energies above 
A when initial credit is vq + A {A G N'^). 

As a consequence of the two previous lemmas, we have the following theorem. 

Theorem 2. Generalized energy games are determined under finite-memory, and determinacy co- 
incide with determinacy under finite-memory for generalized energy games. 

Remark 1 . Note that even if Player 2 can be restricted to play memoryless strategies in generalized 
energy games, it may be that Player 1 is winning with some initial credit vector vq when Player 2 
is memoryless, and is not winning with the same initial credit vector vq when Player 2 can use 
arbitrary strategies. This situation is illustrated in Fig. 1 where Player 1 (owning round states) can 
maintain the energy nonegative in all dimensions with initial credit (2, 0) when Player 2 (owning 
square states) is memoryless. Indeed, either Player 2 chooses the left edge from qq to qi and Player 1 
wins, or Player 2 chooses the right edge from qq to ^2) and Player 1 wins as well by alternating the 
edges back to go- Now, if Player 2 has memory, then Player 2 wins by choosing first the right edge 
to q2, which forces Player 1 to come back to qq with multi- weight (—1, 1). The energy level is now 
(1,1) in go and Player 2 chooses the left edge to gi which is losing for Player 1. Note that Player 1 
wins with initial credit (2, 1) and (3, 0) (or any larger credit) against all arbitrary strategies of 
Player 2. 

We now show that generalized mean-payoff games (where players are restricted to play finite- 
memory strategies by definition) are log-space equivalent to generalized energy games. First note 
that the mean-payoff threshold problem with threshold vector v ^Ir can be reduced to the mean- 
payoff threshold problem with threshold vector {O}'^, by shifting all multi- weights in the game 
graph by v (which has the effect of shifting the mean-payoff value by v). Given this reduction, the 
following result shows that the unknown initial credit problem (for multi-energy games) and the 
mean-payoff threshold problem (with finite- memory strategies) are equivalent. 



Theorem 3. For all multi-weigthed two-player game structures G with dimension k, the answer 
to the unknown initial credit problem is Yes if and only if the answer to the mean-payoff threshold 
problem (for finite memory) with threshold vector {0}'^ is Yes. 

Proof. First, assume that there exists a winning strategy Ai for Player 1 in G for the multi energy 
objective PosEnergyfj(t;o) (for some vq). Theorem 2 estabhshes that finite memory is sufficient to 
win muhi-energy games, so we can assume that Ai has finite memory. Consider the restriction of 
the graph G\-^ to the reachable vertices, and we show that the energy vector of every simple cycle 
is nonnegative. By contradcition, if there exists a simple cycle with energy vector negative in one 
dimension, then the infinite path that reaches this cycle and loops through it forever would violate 
the objective PosEnergy(^(t;o) regardless of the vector vq. 

Now, this shows that every reachable cycle in G\^ has nonnegative mean-payoff value in all 
dimensions, hence Ai is winning for the multi mean-payoff objective MeanPayoff(j({0} ). 

Second, assume that there exists a finite-memory strategy Ai for Player 1 that is winning in G 
for the multi mean-payoff objective MeanPayoff(^({0} ). By the same argument as above, all simple 
cycles in G\^ are nonnegative and the strategy Ai is also winning for the objective PosEnergy(^(t'o) 
for some vq. Taking vq = {nVF} where n is the number of states in G\^ (which bounds the length 
of the acyclic paths) and W a Z is the largest weight in the game suffices. 

Note that the result of Theorem 3 does not hold for arbitrary strategies as shown in the following 
lemma. 

Lemma 4. In generalized mean-payoff games, infinite memory may be necessary to win (finite- 
memory strategies may not be sufficient). 

Proof. To show this, we first need to define the mean-payoff vector of arbitrary plays (because 
arbitrary strategies, i.e., infinite-memory strategies, may produce non-ultimately periodic plays). 
In particular, the limit of - • EL(7r(n)) for n — t- oo may not exist for arbitrary plays vr. Therefore, 
two possible definitions are usually considered, namely either MP (tt) = liminf„^oo^ • EL(7r(n)), 
or MP(7r) = limsup„_^o^ - • EL(7r(n)). In both cases, better payoff can be obtained with infinite 
memory: the example of Fig. 2 shows a game where all states belong to Player 1. We claim that 
(a) for MP , Player 1 can achieve a threshold vector (1,1), and (b) for MP, Player 1 can achieve 
a threshold vector (2,2); (c) if we restrict Player 1 to use a finite-memory strategy, then it is not 
possible to win the multi mean-payoff objective with threshold (1, 1) (and thus also not with (2, 2)). 
To prove (a), consider the strategy that visits n times qa and then n times qb, and repeats this 
forever with increasing value of n. This guarantees a mean-payoff vector (1, 1) for MP because in 
the long-run roughly half of the time is spent in qa and roughly half of the time in %• To prove (6), 
consider the strategy that alternates visits to qa and qb such that after the nth alternation, the 
self-loop on the visited state q {q £ {qa, qb}) is taken so many times that the average frequency of 
q gets larger than - in the current finite prefix of the play. This is always possible and achieves 
threshold (2,2) for MP. Note that the above two strategies require infinite memory. To prove (c), 
notice that finite- memory strategies produce an ultimately periodic play and therefore MP and M P 
coincide with MP. It is easy to see that such a play cannot achieve (1, 1) because the periodic part 
would have to visit both qa and qb and then the mean-payoff vector (fi,f2) of the play would be 
such that wi + f 2 < 2 and thus vi = f2 = 1 is impossible. 

Theorem 3 and Lemma 4, along with Theorem 2 gives the following result. 
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Fig. 2. A generalized mean-payoff game where infinite memory is necessary to win (Lemma 4). 



Theorem 4. Generalized mean-payoff games are determined under finite-memory, however deter- 
minacy and determined under finite-memory do not coincide for generalized mean-payoff games. 

4 coNP-completeness for Finite-Memory Strategies 

In this section, we present a nondeterministic polynomial time algorithm to recognize the instances 
for which there is no winning strategies for Player 1 in a multi-energy game. First, we show that 
the one-player version of this game can be solved by checking the existence of a circuit (i.e., a not 
necessarily simple cycle) with overall nonnegative effect in all dimensions. Second, we build on this 
and the memoryless result for Player 2 to define a coNP algorithm. The main result (Theorem 5) 
is derived from Lemma 6 and Lemma 7 below. 

Theorem 5. The unknown initial credit and the mean-payoff threshold problems for multi-weighted 
two-player game structures are coNP- complete. 

coNP upper bound. First, we need the following result about finding zero circuits in multi- 
weighted directed graphs (a graph is a one-player game). A zero circuit is a finite sequence sqSi . . . Sn 
such that So = ■^ni ("^i, Sj+i) G -E for all < i < n, and Yll=o ^(-^i) ^i+i) = (0) 0; • • • ) 0)- The circuit 
need not be simple. 

Lemma 5 ([13]). Determining if a k-dimensional directed graph contains a zero circuit can be 
done in polynomial time. 

Lemma 6. The unknown initial credit and the mean-payoff threshold problems for multi-weighted 
two-player game structures are in coNP. 

Proof. By Lemma 3, we know that Player 2 can be restricted to play memoryless strategies. A 
coNP algorithm can guess a memoryless strategy A and check in polynomial time that it is winning 
using the following argument. 

First, consider the graph Gx as a one-player game (in which all states belong to player 1. 
We show that if there exists an initial energy level vq and an infinite play vr = sqSi . . . s„ . . . in 
G\ such that vr € PosEnergy(wo) then there exist a reachable circuit in G\ that has nonnegative 
effect in all dimensions. To show that, we extend vr with the energy information as follows: vr' = 
{sq,wo){si,wi) . . . {sn,Wn) . . . where wq = vq and for all i > I, Wi = vq + EL(7r(i)). As tt G 
PosEnergy(wo), we know that for all i > 0, Wi ^ N^. So, we can define the following order on the 
pairs (s, w) G {S1US2) x N in the run: (s, w) Q {s', w') iff s = s' and w{j) < w'{j) for all 1 < j < k. 
From Lemma 1, it is easy to show that C is a wqo. Then there exist two positions ii < ^2 in vr' 



such that (sj^,ttJj^) 1^ (si^,Wi^). The circuit underlying those two positions has nonnegative effect 
in all dimensions. 

Based on this, we can decide if there exists an initial energy vector vq and an infinite path in 
Gx that satisfies PosEnergyQ(t;o) using the result of Lemma 5 on modified version of G\ obtained 
as follows. In every state of Gx, we add k self-loops with respective multi-weight (—1,0,... ,0), 
(0, —1, 0, . . . , 0), . . . , (0, . . . , 0, —1), i.e. each self-loop removes one unit of energy in one dimension. 
It is easy to see that Gx has a circuit with nonnegative effect in all dimensions if and only if the 
modified Gx has a zero circuit, which can be determined in polynomial time. The result follows. 

Lo'wer bound: coNP-hardness. We show that the unknown initial credit problem for multi- 
weighted two-player game structures is coNP-hard. We present a reduction from the complement 
of the 3SAT problem which is NP-complete [15]. 

Hardness proof. We show that the problem of deciding whether Player 1 has a winning strategy 
for the unknown initial credit problem for multi-weighted two-player game structures is at least as 
hard as deciding whether a 3SAT formula is unsatisfiable. Consider a 3SAT formula ip in CNF with 
clauses Ci, C2, . . . ,Gk over variables {xi, X2, . . . , a:„}, where each clause consists of disjunctions of 
exactly three literals (a literal is a variable or its complement). Given the formula ip, we construct a 
game graph as shown in Figure 3. The game graph is as follows: from the initial position. Player 1 
chooses a clause, then from a clause Player 2 chooses a literal that appears in the clause (i.e., 
makes the clause true). From every literal the next position is the initial position. We now describe 
the multi-weight labeling function w. In the multi-weight function there is a component for every 
literal. For edges from the initial position to the clause positions, and from the clause positions 
to the literals, the weight for every component is 0. We now define the weight function for the 
edges from literals back to the initial position: for a literal y, and the edge from y to the initial 
position, the weight for the component of ?/ is 1, the weight for the component of the complement 
of y is —1, and for all the other components the weight is 0. We now define a few notations related 
to assignments of truth values to literals. We consider assignments that assign truth values to all 
the literals. An assignment is valid if for every literal the truth value assigned to the literal and its 
complement are complementary (i.e., for all 1 < i < n, if Xi is assigned true (resp. false), then the 
complement Xj of Xj is assigned false (resp. true)). An assignment that is not valid is conflicting 
(i.e., for some I < i < n, both Xj and Xj are assigned the same truth value). If the formula ip is 
satisfiable, then there is a valid assignment that satisfies all the clauses. If the formula tp is not 
satisfiable, then every assignment that satisfies all the clauses must be confiicting. We now present 
two directions of the hardness proof. 

^p satisfiable implies Player 2 winning. We show that if ip is satisfiable, then Player 2 has a mem- 
oryless winning strategy. Since tp is satisfiable, there is a valid assignment A that satisfies every 
clause. The memoryless strategy is constructed from the assignment A as follows: for a clause Ci, 
the strategy chooses a literal as successor that appears in Ci and is set to true by the assignment. 
Consider an arbitrary strategy for Player 1, and the infinite play: the literals visited in the play 
are all assigned truth values true by A, and the infinite play must visit some literal infinitely often. 
Consider the literal x that appears infinitely often in the play, then the complement literal x is 
never visited, and every time literal x is visited, the component corresponding to x decreases by 1, 
and since x appears infinitely often it follows that the play is winning for Player 2 for every finite 
initial credit. It follows that the strategy for Player 2 is winning, and the answer to the unknown 
initial credit problem is "No". 
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Fig. 3. Game graph construction for a 3SAT formula (Lemma 7). 

^ not satisfiable implies Player 1 is winning. We now show that if ■(/' is not satisfiable, then Player 1 
is winning. By determinacy, it suffices to show that Player 2 is not winning, and by existence of 
memoryless winning strategy for Player 2 (Lemma 3) , it suffices to show that there is no memoryless 
winning strategy for Player 2. Fix an arbitrary memoryless strategy for Player 2, (i.e., in every clause 
Player 2 chooses a literal that appears in the clause). If we consider the assignment A obtained 
from the memoryless strategy, then since V' is not satisfiable it follows that the assignment A is 
conflicting. Hence there must exist clause Cj and Cj and variable Xk such that the strategy chooses 
the literal Xk in Cj and the complement variable x^ in Cj. The strategy for Player 1 that at the 
starting position alternates between clause Cj and Cj, along with that the initial credit of 1 for the 
component of Xk and Xj^., and for all other components, ensures that the strategy for Player 2 is 
not winning. Hence the answer to the unknown initial credit problem is "Yes", and we have the 
following result. 

Lemma 7. The unknown initial credit and the mean-payoff threshold problems for multi-weighted 
two-player game structures are coNP-hard. 

Observe that our hardness proof works with weights restricted to the set { — 1, 0, 1}. 

5 NP-completeness for Memoryless Strategies 

In this section we consider the unknown initial credit and the mean-payoff threshold problems for 
multi-weighted two-player game structures when Player 1 is restricted to use memoryless strategies. 
We will show NP-completeness for these problems. 

Lemma 8. The unknown intial credit and the mean-payoff threshold problems for multi-weighted 
two-player game structures for memoryless strategies for Player 1 lie in NP. 

Proof. The inclusion in NP is obtained as follows: the polynomial witness is the memoryless strategy 
for Player 1, and once the strategy is fixed we obtain a game graph with choices for Player 2 only. 
The verification problem for the unknown initial credit checks that for every dimension there is 
no negative cycle, and the verification problem for mean-payoff threshold checks that for every 
dimension every cycle satisfy the threshold condition. Both the above verification problem can be 
achieved in polynomial time by solving the energy-game and mean-payoff game problem on graphs 
with choices for Player 2 only [12, 3, 6]. The desired result follows. 
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Lemma 9 shows NP-hardness for dimension k = 2 and arbitrary integral weights, and is obtained 
by a reduction from the Knapsack problem. If /c = 1, then the problems reduces to the classical 
energy and mean-payoff games, and is in NP fl coNP [3, 6, 19] (so the hardness result cannot be 
obtained for k = 1). 

Lemma 9. The unknown intial credit and the mean-payoff threshold problems for multi-weighted 
two-player game structures for memoryless strategies for Player 1 are NP-hard, even in one-player 
game structures with dimension k = 2 for the weight function. 

Proof. We present a reduction from the KNAPSACK problem. The KNAPSACK problem consists of 
a set / = {1,2, . . . ,n} of n items, for each item i there is a profit pj € N and a weight Wi G N. 
Given a weight bound B and profit bound P, the Knapsack problem asks whether there exists 
a subset J C I of items such that (a) Y^j^jWj < B; and (b) Ylij&jPj — ^ i^-^n & profit of 
P can be accumulated without exceeding weight B). The Knapsack problem is NP-hard [15]. 
Our reduction is as follows: given an instance of the Knapsack problem we construct a one- 
player game structure with a weight function of dimension 2. The set of positions is as follows: 
5i = / U {{i,j)\i e I,j e {Y,N}} U {n + 1} and ^2 = 0. The set of edges is as follows: E = 
{{i,{i,Y)),{i,{i,N)) I i £ /}U{((i,y),i + l),((i,7V),i + l) | i G /} U {(n + 1, 1)}. Intuitively, in 
the game structure, for every item Player 1 has a choice of "Yes" (edge from i to {i,Y)) to select 
item i, and choice of "No" (edge from i to {i,N)) to not select item i. From {i,Y) and {i,N) the 
next position is i + 1, and from the position n + 1 the next position is 1. The weight function 
function w : E ^ W? has two dimensions: (a) for edge e = (i, (i, N)) we have w{e) = (0, 0) (i.e., for 
the choice of "No" all the weights are 0); (b) for an edge e = (i, {i,Y)) we have w{e) = {pi, —Wi) 
(i.e., for the choice of "Yes", the first component gains the profit and the second component loses 
the weight of item i); (c) for an edge e = {{i,Y),i + 1) or e = {{i,N),i + 1) we have w{e) = 0; 
and (d) for the edge e = (n + 1, 1) we have w{e) = {—P,B) (i.e., there is a loss of P in the first 
component and a gain of B in the second component). The construction is illustrated in Fig 4. 
Given a solution J for the Knapsack problem, the memoryless strategy that choose (j, {j,Y)) for 
j € J, and {j' , {j' , N)) for j' G / \ J, with intial credit (0, B) is a solution for the unknown initial 
credit problem. Conversely, given a memoryless strategy Ai for the unknown initial credit problem, 
the set J = {j G / I Ai(j) = {j,Y)} is a solution to the Knapsack problem. The argument for the 
mean-payoff threshold problem is analogous. The result follows. 

In Lemma 10 we show the hardness of the problem when the weights are in { — 1, 0, 1}, but the 
dimension is arbitrary. It has been shown in [11] that if the weights are { — 1, 0, 1} and the dimension 
is 2, then the problem can be solved in polynomial time. 

Lemma 10. The unknown intial credit and the mean-payoff threshold problems for multi-weighted 
two-player game structures for memoryless strategies for Player 1 are NP-hard, even in one-player 
game structures when weights are restricted to { — 1,0, 1}. 

Proof. We present a reduction from the 3SAT problem. Consider a 3SAT formula ^ over a set 
X = {xi, X2, • • • , Xn\ of variables, and a set Ci, C2, . . . , Cm of clauses such that each clause has 
3-literals (a literal is a variable or its complement). We construct a one-player game structure with 
a weight function of dimension m from <?. The set of positions is S*! = X U {(xj,j) | Xj G X, j G 
{T, F}} U {xn+i} and 5*2 = 0. The set of edges is as follows: E = {(xj, {xi,T)), (xj, (xi,F)) | Xj G 
X} U {((xj,T),Xi+i), ((xj, F),Xi+i) I Xj G X} U {(x„_|_i,xi)}. Intuitively, in the game structure, 
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Fig. 4. Knapsack Reduction. 

for every variable Player 1 has a choice to set Xi as "True" (edge from Xi to (xi,r)), and choice 
to set Xi as "False" (edge from xi to (xj,F)). From {xi,T) and {xi,F) the next position is Xj+i, 
and from the position Xn+i the next position is xi. The construction of the graph is similar as in 
Fig 4. The weight function w : E ^ 17^ has m dimensions: (a) for an edge e = (xj, (a;i,T)) (resp. 
e = (xj, (xj, F))) and 1 < A; < m, the /c-th component of w(e) is 1 if the choice Xj as "True" (resp. 
"False") satisfies clause Cfc, and otherwise the /c-th component is 0; (b) for edges e = ((xj, j), Xj+i), 
with J € {T, F}, every component of w{e) is 0; and (c) for the edge e = (x„_|_i, xi), for all 1 < A; < m, 
the /c-th component of i(;(e) = —1. If ^ is satisfiable, then consider a satisfying assignment A, 
and we construct a memoryless strategy Ai as follows: for a position Xj, if A{xi) is "True", then 
choose {xi,T), otherwise choose {xi,F). The memoryless strategy Ai with initial credit vector {0}™ 
ensures that the answer to the unknown initial credit problem for memoryless strategies is "Yes". 
Conversely, if there is a memoryless strategy Ai for the unknown initial credit problem, then the 
memoryless strategy must satisfy every clause. A satisfying assignment A for (p is as follows: A{xi) 
is "True" if Ai(xj) = {xi,T), and "False", otherwise. It follows that ^ is satisfiable iff the answer 
to the unknown initial credit problem for memoryless strategies is "Yes". The argument for the 
mean-payoff threshold problem is analogous. The desired result follows. 

The following theorem follows from the results of Lemma 8, Lemma 9 and Lemma 10. 

Theorem 6. The unknown initial credit and the mean-payoff threshold problems for multi-weighted 
two-player game structures for memoryless strategies for Player 1 are NP-complete. 

6 Conclusion 

In this work we considered games with multiple mean-payoff and energy objectives, and established 
determinacy under finite-memory, inter-reducibility of these two classes of games for finite-memory 
strategies, and improved the complexity bounds from EXPSPACE to coNP-complete. 

Two interesting problems are open: (A) for generalized mean-payoff games, the winning strate- 
gies with infinite memory are more powerful than finite-memory strategies, and the complexity of 
solving generalized mean-payoff games with infinite-memory strategies remains open. (B) it is not 
knwon how to compute the exact or approximate Pareto curve (trade-off curve) for multi-objective 
mean-payoff and energy games. 
Acknowledgement. We are grateful to Jean Cardinal for pointing the reference [13]. 
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